20 research outputs found

    Reverse Polarization in Amino acid and Nucleotide Substitution Patterns Between Human–Mouse Orthologs of Two Compositional Extrema

    Get PDF
    Genome-wide analysis of sequence divergence patterns in 12 024 human–mouse orthologous pairs reveals, for the first time, that the trends in nucleotide and amino acid substitutions in orthologs of high and low GC composition are highly asymmetric and polarized to opposite directions. The entire dataset has been divided into three groups on the basis of the GC content at third codon sites of human genes: high, medium, and low. High-GC orthologs exhibit significant bias in favor of the replacements, Thr → Ala, Ser → Ala, Val → Ala, Lys → Arg, Asn → Ser, Ile → Val etc., from mouse to human, whereas in low-GC orthologs, the reverse trends prevail. In general, in the high-GC group, residues encoded by A/U-rich codons of mouse proteins tend to be replaced by the residues encoded by relatively G/C-rich codons in their human orthologs, whereas the opposite trend is observed among the low-GC orthologous pairs. The medium-GC group shares some trends with high-GC group and some with low-GC group. The only significant trend common in all groups of orthologs, irrespective of their GC bias, is (Asp)Mouse → (Glu)Human replacement. At the nucleotide level, high-GC orthologs have undergone a large excess of (A/T)Mouse → (G/C)Human substitutions over (G/C)Mouse → (A/T)Human at each codon position, whereas for low-GC orthologs, the reverse is true

    Distinct, ecotype-specific genome and proteome signatures in the marine cyanobacteria Prochlorococcus

    Get PDF
    The marine cyanobacterium Prochlorococcus marinus, having multiple ecotypes of distinct genotypic/ phenotypic traits and being the first documented example of genome shrinkage in free-living organisms, offers an ideal system for studying niche-driven molecular micro-diversity in closely related microbes. The present study,through an extensive comparative analysis of various genomic/proteomic features of 6 high light (HL) and 6 low light (LL) adapted strains, makes an attempt to identify molecular determinants associated with their vertical niche partitioning. Pronounced strand-specific asymmetry in synonymous codon usage is observed exclusively in LL strains. Distinct dinucleotide abundance profiles are exhibited by 2 LL strains with larger genomes and G+C-content ≈ 50% (group LLa), 4 LL strains having reduced genomes and G+C-content ≈ 35-37% (group LLb), and 6 HL strains. Taking into account the emergence of LLa, LLb and HL strains (based on 16S rRNA phylogeny), a gradual increase in average aromaticity, pI values and beta- & coil-forming propensities and a decrease in mean hydrophobicity, instability indices and helix-forming propensities of core proteins are observed. Greater variations in orthologous gene repertoire are found between LLa and LLb strains, while higher number of positively selected genes exist between LL and HL strains. Strains of different Prochlorococcus groups are characterized by distinct compositional, physicochemical and structural traits that are not mere remnants of a continuous genetic drift, but are potential outcomes of a grand scheme of niche-oriented stepwise diversification, that might have driven them chronologically towards greater stability/fidelity and invoked upon them a special ability to inhabit diverse oceanic environments

    Molecular signature of hypersaline adaptation: insights from genome and proteome composition of halophilic prokaryotes

    Get PDF
    A comparative genomic and proteomic study of halophilic and non-halophilic prokaryotes identifies specific genomic and proteomic features typical of halophilic species that are independent from genomic GC-content and taxonomic position

    Oligonucleotide Frequencies of Barcoding Loci Can Discriminate Species across Kingdoms

    Get PDF
    Background: DNA barcoding refers to the use of short DNA sequences for rapid identification of species. Genetic distance or character attributes of a particular barcode locus discriminate the species. We report an efficient approach to analyze short sequence data for discrimination between species. Methodology and Principal Findings: A new approach, Oligonucleotide Frequency Range (OFR) of barcode loci for species discrimination is proposed. OFR of the loci that discriminates between species was characteristic of a species, i.e., the maxima and minima within a species did not overlap with that of other species. We compared the species resolution ability of different barcode loci using p-distance, Euclidean distance of oligonucleotide frequencies, nucleotide-character based approach and OFR method. The species resolution by OFR was either higher or comparable to the other methods. A short fragment of 126 bp of internal transcribed spacer region in ribosomal RNA gene was sufficient to discriminate a majority of the species using OFR. Conclusions/Significance: Oligonucleotide frequency range of a barcode locus can discriminate between species. Ability to discriminate species using very short DNA fragments may have wider applications in forensic and conservation studies

    Molecular association of glucose-6- phosphate isomerase and pyruvate kinase M2 with glyceraldehyde-3-phosphate dehydrogenase in cancer cells

    Get PDF
    Background: For a long time cancer cells are known for increased uptake of glucose and its metabolization through glycolysis. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a key regulatory enzyme of this pathway and can produce ATP through oxidative level of phosphorylation. Previously, we reported that GAPDH purified from a variety of malignant tissues, but not from normal tissues, was strongly inactivated by a normal metabolite, methylglyoxal (MG).Molecular mechanism behind MG mediated GAPDH inhibition in cancer cells is not well understood. Methods: GAPDH was purified from Ehrlich ascites carcinoma (EAC) cells based on its enzymatic activity. GAPDH associated proteins in EAC cells and 3-methylcholanthrene (3MC) induced mouse tumor tissue were detected by mass spectrometry analysis and immunoprecipitation (IP) experiment, respectively. Interacting domains of GAPDH and its associated proteins were assessed by in silico molecular docking analysis. Mechanism of MG mediated GAPDH inactivation in cancer cells was evaluated by measuring enzyme activity, Circular dichroism (CD) spectroscopy, IP and mass spectrometry analyses. Result: Here, we report that GAPDH is associated with glucose-6-phosphate isomerase (GPI) and pyruvate kinase M2 (PKM2) in Ehrlich ascites carcinoma (EAC) cells and also in 3-methylcholanthrene (3MC) induced mouse tumor tissue. Molecular docking analyses suggest C-terminal domain preference for the interaction between GAPDH and GPI. However, both C and N termini of PKM2 might be interacting with the C terminal domain of GAPDH. Expression of both PKM2 and GPI is increased in 3MC induced tumor compared with the normal tissue. In presence of 1 mM MG,association of GAPDH with PKM2 or GPI is not perturbed, but the enzymatic activity of GAPDH is reduced to 26.8 ± 5 % in 3MC induced tumor and 57.8 ± 2.3 % in EAC cells. Treatment of MG to purified GAPDH complex leads to glycation at R399 residue of PKM2 only, and changes the secondary structure of the protein complex. Conclusion: PKM2 may regulate the enzymatic activity of GAPDH. Increased enzymatic activity of GAPDH in tumor cells may be attributed to its association with PKM2 and GPI. Association of GAPDH with PKM2 and GPI could be a signature for cancer cells. Glycation at R399 of PKM2 and changes in the secondary structure of GAPDH complex could be one of the mechanisms by which GAPDH activity is inhibited in tumor cells by MG

    Transcriptomic and metabolomic shifts in rice roots in response to Cr (VI) stress

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Widespread use of chromium (Cr) contaminated fields due to careless and inappropriate management practices of effluent discharge, mostly from industries related to metallurgy, electroplating, production of paints and pigments, tanning, and wood preservation elevates its concentration in surface soil and eventually into rice plants and grains. In spite of many previous studies having been conducted on the effects of chromium stress, the precise molecular mechanisms related to both the effects of chromium phytotoxicity, the defense reactions of plants against chromium exposure as well as translocation and accumulation in rice remain poorly understood.</p> <p>Results</p> <p>Detailed analysis of genome-wide transcriptome profiling in rice root is reported here, following Cr-plant interaction. Such studies are important for the identification of genes responsible for tolerance, accumulation and defense response in plants with respect to Cr stress. Rice root metabolome analysis was also carried out to relate differential transcriptome data to biological processes affected by Cr (VI) stress in rice. To check whether the Cr-specific motifs were indeed significantly over represented in the promoter regions of Cr-responsive genes, occurrence of these motifs in whole genome sequence was carried out. In the background of whole genome, the lift value for these 14 and 13 motifs was significantly high in the test dataset. Though no functional role has been assigned to any of the motifs, but all of these are present as promoter motifs in the Database of orthologus promoters.</p> <p>Conclusion</p> <p>These findings clearly suggest that a complex network of regulatory pathways modulates Cr-response of rice. The integrated matrix of both transcriptome and metabolome data after suitable normalization and initial calculations provided us a visual picture of the correlations between components. Predominance of different motifs in the subsets of genes suggests the involvement of motif-specific transcription modulating proteins in Cr stress response of rice.</p

    Universal Plant DNA Barcode Loci May Not Work in Complex Groups: A Case Study with Indian Berberis Species

    Get PDF
    BACKGROUND: The concept of DNA barcoding for species identification has gained considerable momentum in animals because of fairly successful species identification using cytochrome oxidase I (COI). In plants, matK and rbcL have been proposed as standard barcodes. However, barcoding in complex genera is a challenging task. METHODOLOGY AND PRINCIPAL FINDINGS: We investigated the species discriminatory power of four reportedly most promising plant DNA barcoding loci (one from nuclear genome--ITS, and three from plastid genome--trnH-psbA, rbcL and matK) in species of Indian Berberis L. (Berberidaceae) and two other genera, Ficus L. (Moraceae) and Gossypium L. (Malvaceae). Berberis species were delineated using morphological characters. These characters resulted in a well resolved species tree. Applying both nucleotide distance and nucleotide character-based approaches, we found that none of the loci, either singly or in combinations, could discriminate the species of Berberis. ITS resolved all the tested species of Ficus and Gossypium and trnH-psbA resolved 82% of the tested species in Ficus. The highly regarded matK and rbcL could not resolve all the species. Finally, we employed amplified fragment length polymorphism test in species of Berberis to determine their relationships. Using ten primer pair combinations in AFLP, the data demonstrated incomplete species resolution. Further, AFLP analysis showed that there was a tendency of the Berberis accessions to cluster according to their geographic origin rather than species affiliation. CONCLUSIONS/SIGNIFICANCE: We reconfirm the earlier reports that the concept of universal barcode in plants may not work in a number of genera. Our results also suggest that the matK and rbcL, recommended as universal barcode loci for plants, may not work in all the genera of land plants. Morphological, geographical and molecular data analyses of Indian species of Berberis suggest probable reticulate evolution and thus barcode markers may not work in this case

    Design, Development and Implementation of Novel in silico Data Mining, Clustering and Visualization Tools for Comparative Genome and Proteome Analysis

    No full text
    If one recited the human genome sequence at the rate of 5 bases per second for 24 hours a day, it would take one about 19 years to recite the book of life. This is only taking into account 3 billion bases of the genome sequence itself, excluding data annotations and other information associated with sequence data. As of April 15, 2012, GenBank release 189.0 has 139,266,481,398 bases from 151,824,421 reported sequences - nearly 47 times the human genome sequence content (GenBank release note - ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt). The number of sequenced genomes is presently growing at an unprecedented pace and there should be no reason to expect it to slow down in the future. On the contrary, the introduction of genomics at a massive scale with „second-generation sequencing technologies‟ such as 454, Solexa or Solid adds to this expectation (Mardis 2008), as illustrated by the recent genome sequencing of a single individual. Importantly, these new technologies promise to be very useful for genome analysis in non-model organisms (Ellegren 2008; Hudson 2008; Vera et al. 2008). Moreover, the 1000 genomes project will create a new map of genetic variation for our genome. Other projects are helping to catalogue genes involved in cancer, alternative splicing in different tissues and transcription factor binding, for example. The evolution of „omic‟ science through microarray transcriptomics, metabolomics, proteomics, etc. is generating this huge data. This data have begun to revolutionize genomics and their effects are becoming increasingly widespread. There has been a vital need to warehouse, harness, disseminate, analyze and interpret this torrents of data, not only to quench the academic thirst of the researchers, but also for medical diagnostic and therapeutic uses and several other biotechnological applications. For this, use of computational approaches was inevitable and thus emerged Bioinformatics – a new field of scientific enquiry that represents a confluence of biology, computer science and information technology
    corecore